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Cn ' Abstract. We describe factor frequencies of the generalized Thue- Morse word tj,,™ defined for 

fe > 2, m > 1, 6, TTi G N, as the fixed point starting in of the morphism 

(fii,m.k -^ k{k + l)...{k + b-l), 

l/^ ' where k £ {0, 1, ... ,m — 1} and where the letters are expressed modulo m. We use the result 

^SJ ' of Frid [4] and the study of generalized Thue-Morse words by Starosta [8] . 

Q ■ 1. Introduction 
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The generahzed Thue-Morse word t^m is defined for b > 2, m > 1, 6, m,€N, as the fixed 



r^ • point starting in of the morphism 

^ ■ fb,m-k -^ k{k + l)...{k + b-l), 

where k E {0,1,... ,m — 1} and where the letters are expressed modulo m. Naturally, the 
class of generalized Thue-Morse words contains the famous Thue-Morse word t2,2 whose factor 
^ ■ frequencies have been determined by Dekking [3]. 

cn [ Generalized Thue-Morse words belong to the class of circular fixed points of uniform marked 

^P ' primitive morphisms. For such a class, Frid [4] has provided an algorithm for the computation of 

if-^ . factor frequencies. We recall her algorithm in Section 1. The aim of this paper is to describe the 

I • I set of frequencies of factors in t^ ^ of length n for every n S N. The most direct way is to apply 

(~^ ■ Frid's algorithm. However, there is even an easier way thanks to the knowledge of reduced Rauzy 

graphs (obtained from the description of bispecial factors by Starosta [8] ) and the invariance of 
the generalized Thue-Morse word under symmetries preserving factor frequencies. In Section 2, 
we define reduced Rauzy graphs and their relation to factor frequencies. In Section 3, we explain 
k>( \ what a symmetry is and how it preserves factor frequencies. The main result is presented in 

^ ■ Section 4, where we combine Frid's algorithm, reduced Rauzy graphs, and symmetries in order 



to get factor frequencies of generalized Thue-Morse words. Recently, an optimal upper bound 
on the number of factor frequencies in infinite words whose language is invariant under more 
symmetries has been derived in [2]. The generalized Thue-Morse word is an example of infinite 
words for which the upper bound is not attained, as shown in Section 5. 

We ask the reader to consult Preliminaries of the paper Generalized Thue-Morse words and 
palindromic richness by Starosta [8] for undefined terms. 

2. Factor frequencies of fixed points of circular marked uniform morphisms 

If tD is a factor of an infinite word u and if the following limit exists 

#|occurrences of w in v} 
hm — , 

\v\^oo,v£C{u) \v\ 
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then it is denoted by p{w) and called the frequency of w. It is known [7] that factor frequencies 
of fixed points of primitive morphisms exist. Generalized Thue-Morse words are fixed points of 
primitive morphisms, therefore we limit our considerations in the sequel to primitive morphisms. 
Let us recall first a result of Frid [4] , which is useful for the calculation of factor frequencies 
in fixed points of primitive morphisms. In order to introduce the result, we need some further 
notions. Let y? be a morphism on A* = {ai,a2, . . . ,am}*- We associate with (p the incidence 
matrix M^p given by [M^p]ij = |c/?(aj)|(i-, where |(/3(aj)|a- denotes the number of occurrences of a^ 
in ip{aj). The morphism if is called primitive if there exists fc € N satisfying that the power M^ 
has all entries strictly positive. As shown in [7], for fixed points of primitive morphisms, 

• factor frequencies exist, 

• it follows from the Perron- Frobenius theorem that the incidence matrix has one dominant 
eigenvalue A which is larger than the modulus of any other eigenvalue, 

• the components of the unique eigenvector (xi, X2, . . . , Xm)'^ corresponding to A normal- 
ized so that YlT^i^i — 1 coincide with the letter frequencies, i.e., Xi = p{ai) for all 
ie{l,2,...,m}. 

Let (f he a morphism on A*. We denote ipij : A'^ —^ A'^, where i,j G N, the mapping that 
associates with v £ A'^ the word ipij{v) obtained from (p{v) by erasing i letters from the left 
and j letters from the right, where i+ j < \ip{v)\. We say that a word v G A'^ admits an 
interpretation s = (pQhi . . .h„L-,i-,j) if v = ipij {bobi . . . bm) , where bk & A and i < |v?(6o)[ and 
j < \ip{bm)\- The word a{s) = &o^i ... &m is an ancestor of s. The set of all interpretations of 
V is denoted I{v). Now, we can recall the result of Frid for factor frequencies of fixed points of 
primitive morphisms. 

Proposition 2.1. Let f be a primitive morphism having a fixed point u and let A be the dominant 
eigenvalue of the incidence matrix M^. Then for any factor v G /^(u), it holds 

(1) p{v) = \y1 /'(«(^))- 

For circular fixed points of uniform marked primitive morphisms, the algorithm of Frid [4] 
provides the possible frequencies of factors of a given length and for every frequency, the number 
of factors having that frequency. In order to describe her algorithm, we have to recall several 
notions. A morphism if defined on the alphabet A is called uniform if all images of letters are 
of the same length b, i.e., |v'(ct)| = b for all a £ A. In the case of a uniform primitive morphism 
(f, the dominant eigenvalue of the incidence matrix M^ is A = 6. A morphism is called marked 
if every pair of distinct letter images differs both in the first letter and in the last letter. Let 
u be a fixed point of a morphism f defined on A, then its factor w contains a synchronization 
point {wi,W2) ii w = W1W2 and for every fi,V2 £ A* and for every factor s of u, there exists 
factors si, S2 of u such that the following implication holds 

V1WV2 = (p{s) =^ S = S1S2, ViWi = (p{si), W2V2 = ip{s2)- 

In other words, a synchronization point marks a boundary between letter images in every oc- 
currence of w in u. Any factor tt; of u that contains a synchronization point is called circular. 
We call a fixed point u of a morphism (p circular (with synchronization delay L) if every factor 
w of length greater than or equal to L is circular. For uniform marked primitive morphisms, 
Proposition 2.1 takes the following form. 

Proposition 2.2. Let v be a circular word of a uniform marked primitive morphism 99 with the 
letter image length b, then there exists a unique interpretation of v. Moreover, if we denote the 
unique ancestor of v by w, then p{v) = E^^. 
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We define the structure ordering number K for fixed points of circular uniform morphisms as 
the least integer satisfying b{K — 1) + 1 > L, where h is the length of letter images and L is 
the synchronization delay. The following statements are to be found in [4] as Proposition 4 and 
Theorem 5. 

Proposition 2.3. Letn > K , there exists a unique triplet of decomposition parameters {p{n), k{n), A(n)), 
where p{n) G N, k{n) e {K,... , b{K - 1)}, and A(n) G {1, . . . , b^W}, such that 

n = y'(")(A;(n)-l) + A(n). 

The explicit formulae read p(n) = log^ ;^j^ —l,k{n)= -g^ , A{n) = n — ¥'^^^\k{n) — l). 

Theorem 2.4. Let u be a circular fixed point of a uniform marked primitive morphism f. 
Denote /^n(u) = {v^ ,V2 , ■ ■ ■ ;^c7 h' ^^^ ^^^ n> K, the set /^„+i(u) can be partitioned into 

(1) C{k{n) + 1) groups of A{n) words each, every word in the jth group having the frequency 

^Pl^f"^""'), J €{!,..., C(M^) + 1)}, 

(2) C{k{n)) groups of b^^"-' — A(n) words each, every word in the jth group having the fre- 
quency ^Pl^f"-*), J G {!,..., C(/c(n))}. 

The frequencies p^v, ), /c € {K, . . . , b{K — 1) + 1}, can be found directly using (1). Theo- 
rem 2.4 provides then explicit formulae for factor frequencies of circular fixed points of uniform 
marked primitive morphisms. 

2.1. Reduced Rauzy graphs. Assume throughout this section that factor frequencies of in- 
finite words in question exist. The Rauzy graph of order n of an infinite word u is a directed 
graph r„ whose set of vertices is Cn{u) and set of edges is Cn+i{u). An edge e = wqWi . . .Wn 
starts in the vertex w = wqWi . . . Wn-i, ends in the vertex v = wi . . . Wn~iWn, and is labeled by 
its factor frequency p{e). 

It is easy to see that edge frequencies in a Rauzy graph r„ behave similarly as the current in 
a circuit. We may formulate an analogy of the Kirchhoff 's current law: the sum of frequencies 
of edges ending in a vertex equals the sum of frequencies of edges starting in this vertex. 

Observation 2.5 (KirchhofF's law for frequencies). Let w be a factor of an infinite word u 
whose factor frequencies exist. Then 

P{w) = ^ piO'W) = Y^ p{wa). 

a6Lcxt(ui) aGRcxt(ui) 

The KirchhofF's law for frequencies has some useful consequences. 

Corollary 2.6. Let w be a factor of an infinite word u whose frequency exists. 

• If w has a unique right extension a, then p{w) = p{wa) . 

• If w has a unique left extension a, then p{w) = p{aw). 

Corollary 2.7. Let w be a factor of an aperiodic recurrent infinite word u whose frequency 
exists. Let v be the shortest BS factor containing w, then p{w) = p{v). 

The assumption of recurrence and aperiodicity in Corollary 2.7 is needed in order to ensure 
that every factor can be extended to a BS factor. 

Corollary 2.6 implies that if a Rauzy graph contains a vertex w with only one incoming edge 
aw and one outgoing edge wb, then p := p{aw) = p{w) = p{wb) = p{awh). Therefore, we can 
replace this triplet (edge-vertex-edge) with only one edge awb keeping the frequency p. If we 
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reduce the Rauzy graph step by step applying the above described procedure, we obtain the 
so-called reduced Rauzy graph Tn, which simplifies the investigation of edge frequencies. In order 
to precise this construction, we introduce the notion of a simple path. 

Definition 2.8. Let r„ be the Rauzy graph of order n of an infinite word u whose factor 
frequencies exist. A factor e of length larger than n such that its prefix and its suffix of length n 
are special factors and e does not contain any other special factors is called a simple path. We 
define the label of a simple path e as p{e) . 

Definition 2.9. The reduced Rauzy graph T^ of u of order n is a directed graph whose set of 
vertices is formed by LS and RS factors of Cn{vi) and whose set of edges is given in the following 
way. Vertices w and v are connected with an edge e if there exists in Tn a simple path starting 
in w and ending in v. We assign to such an edge e the label of the corresponding simple path. 

Remark 2.10. According to Corollary 2.6 and Definition 2.9, if u is an aperiodic recurrent 
infinite word whose factor frequencies exist, it holds for every n G N, 

{p(e) I e G £„+i(u)} = {p{e) \ e edge in Tn}. 

Considering Corollary 2.7 and Definition 2.9, one may observe the following. 

Observation 2.11. Let u be an aperiodic recurrent infinite word whose factor frequencies exist. 
If we find to a reduced Rauzy graph Tn none of whose vertices is a BS factor the reduced Rauzy 
graph of minimal larger order, say T^, containing a vertex being a BS factor, then 

{/9(e) I e edge in Tn} = {pie) \ e edge in Tm} U {p{v) \ v vertex in Fm.}- 
3. Symmetries preserving factor frequency 

We will be interested in symmetries preserving in a certain way factor occurrences in u and 
consequently, frequencies of factors of u. Let us call a symmetry on A* any mapping ^ satisfying 
the following two properties: 

(1) If is a bijection: A* ^ A* , 

(2) for al\w,v eA* 

^{occurrences of w m. v} = ^{occurrences of ^{vo) in ^[v]}. 

The following statements are taken from [2]. 

Theorem 3.1. Let ^i : A* ^ A* . Then '^ is a symmetry if and only if ^ is a morphism or an 
antimorphism such that ^ is a letter permutation when restricted to A. 

Observation 3.2. Let u be an infinite word whose language is invariant under a symmetry ^. 
For every w in C{vl) whose frequency exists, it holds 

p{w)=p{^{w)). 

We denote the set of all morphisms and antimorphisms on A* by AM {A*). 

Theorem 3.3. Let G C AM [A*) be a finite group containing an antimorphism and let u be 
a uniformly recurrent aperiodic infinite word whose language is invariant under all elements of 
G and such that the frequency p{w) exists for every factor w € ^(u). Then there exists M G N 
such that 

ff{p{e)\eeCn+i{u)} < ^(4(C(n + l)-C(n))+#G-X-y) foralln>M, 

where X is the number of BS factors of length n and Y is the number of BS factors of length n 
that are 9 -palindromes for an antimorphism 9 ^ G. 
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4. Factor frequencies of generalized Thue-Morse words 

The generalized Thue-Morse word t{,^m is defined for b > 2, m > 1, 6, m € N, as the fixed 
point starting in of the morphism 

(2) ipb^^:k ^ k{k + l)...{k + b-l), 

where k € Z^ = {0, 1, ... ,m — 1} and where the letters are expressed modulo m. We denote 
by q the smallest positive integer satisfying q(b — 1) = mod m. The word tf, ,„ is periodic if 
and only if 6 = 1 mod m (see [1]). In this case, tb,m = (01 • • • (m — 1))^ , where oj denotes an 
infinite repetition. It is thus readily seen that any factor of t;,,„ has its frequency equal to — . 
Properties of ipb,m and th^m'- 

(1) <pb,m is primitive, therefore letter frequencies exist and are equal to the components of 
the eigenvector —(1, 1, . . . , 1) of the incidence matrix corresponding to the dominant 
eigenvalue b, 

(2) (fb,m is uniform {\Lpb,rnik)\ = b for all k € Z^,), 

(3) ^b,m is marked, 

(4) tb^m is circular with synchronization delay L = 2b 

Proof. Any w G jC-(tb^m) of length greater than or equal to 26 contains either for some 
k,i (z Zm a factor ki, where I ^ k + 1 mod m, or is of length 2b and of the form 
w = k{k + 1) . . . (/c + 26 — 1) for some k G Z^. 

(a) In the first case, it is easy to see that k marks the end of wi and I the beginning of 
W2 in the synchronization point {wi.,W2) oiw. 

(b) In the second case, (tt;, e) is a synchronization point of ti; = k{k + 1) . . . (/c + 26 — 1). 

D 

(5) C{tb^m) is invariant under Dm = \J^x \ x G Zm}U{^a; | x G Z^}, where n^^ is a morphism 
and ^x an antimorphism defined for all k G Z^, by 

Tlx{k) = x + k, 
^xik) = X - k. 

For the proof see [8] . 

The aim of this section is to describe {/9(e) | e G Cn+i{'tb,m)} for all n G N. Theorem 2.4 gives 
explicit formulae for factor frequencies if the frequencies of factors of length nG {!,... ,26+1} 
(the structure ordering number K = 3 for tb^m) are known. There is even an easier way to get 
factor frequencies using symmetries of C{tb^m) and the description of BS factors from [8]. 

Proposition 4.1. If v is a BS factor of C{tb,m) of length greater than or equal to 2b, then there 
exists a BS factor w G C{tb^m) such that v = ipb,m{w). Moreover, p{v) = l-^. 

Proof. The first part has been proved as Lemma 3 in [8]. The second part follows from Propo- 
sition 2.2. D 

Reduced Rauzy graph method (RRG method) 

We get the frequencies {/5(e) | e G Cn+i{tb,m)} for all n G N in the following way. 

(1) We describe reduced Rauzy graphs of order n, where 1 < n < 26 — 1, making use of the 
invariance of C{tb^m) under symmetries. We notice that all of them contain a BS factor 
as a vertex. 

(2) Proposition 4.1 says that every BS factor is of length b n, £ G N, where n G {1, . . . , 26—1}. 
It is not difficult to see that all reduced Rauzy graphs of order greater than or equal 
to 26 containing a BS factor as their vertex are obtained by a repeated application of 
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ifb,m simultaneously to vertices and edges of reduced Rauzy graphs of order n, where 
2 < n <2b — l. By Proposition 4.1, the reduced Rauzy graph of order nb obtained when 
ipb,m is applied i times to the reduced Rauzy graph of order n, where 2 < n < 25 — 1, 
satisfies 

{p{e) I e edge in F^f^t} = {^p{e) \ e edge in f„}, 
{p{v) I V BS vertex in T^^^} = {-^p{v) \ v BS vertex in r„}. 

(3) Applying Observation 2.11, we obtain 

(a) If (n - 1)6^ < N < nb^ for some n G {2, . . . , 26}, then 

{p(e) I e G CN+iitb^m)} = {7£P(e) | e edge in f„} U {iipiv) \ v BS vertex in f„}. 

(b) If A^ = nb^ for some n € {2, . . . , 26 — 1}, then 

{p(e) I e € /:Ar+i(t6,,„)} = {j^p{e) \ e edge in f„}. 
Example 4.2. Xei us illustrate the RRG method for the Thue-Morse word t2 2- 



n = 1 n = 2 n = 3 



0110 1001 



r^; 






1100 



Figure 1. Reduced Rauzy graphs of t2,2 of order n G {1, 2, 3}. 



fi.- p{0) = p{l) = i and {/9(e) | e edge in fi} = {i, i}. 
E'xp/anaizon.- 

• Thanks to Observation 3.2, we have p{0) = p{l), /9(01) = p(10), and p{00) = p{ll)- 

• Using Property (1) of (pb,m (ind tb^m, we get p{0) = ^. 

• By Corollary 2.6 and Proposition 2.2, it holds /9(00) = /9(1001) = ^p(lO). 

• Applying the KirchhojJ's law for frequencies, we get p{0) = p{01) + p{00) = ^p{01), 
consequently p(01) = i. 

r2-' p(01) = p{lO) = I and {p{e) \ e edge in T2} = {g}. 
Explanation: 

• Thanks to Observation 3.2, we have p(OlO) = p(lOl) and p(OllO) = p(lOOl). 

• By Proposition 2.2, it holds p(OllO) = \p{Ql) = \. 

• Applying the Kirchhoff's law for frequencies, we get p{01) = p(OlO) + p(OllO). 
Therefore p(OlO) = |. 

fa; p{OW) = p(lOl) = i and {p{e) \ e edge in fa} = {i, ^}. 
Explanation: 

• Thanks to Observation 3.2, we have p{01l) = p(lOO) = p{00l) = p{llO), p(OOll) = 
p(llOO), p(OlOl) = p(lOlO), p(OOlO) = p(llOl) = /9(1011) = p(OlOO). 



By Corollary 2.6 and Proposition 2.2, it holds p(OOlO) = /9(100101) = ^^(100) 
ip(lOOl) = ^ and p{mil) = /9(100110) = ip(lOl) = ^. 
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• The KirchhojJ 's law for frequencies implies p{OWl) = p(OlO) — p{OWO) = j^- 
(2) All reduced Rauzy graphs of order greater than or equal to 4 containing a BS factor as 
their vertex are depicted in Figure 2. It holds for all k £N 



'f (01) 



V (0110) 




(1001) 



■f-- (10) 




(0011) 



V (Oil) *7 



(001) 



1011) f '^^^iOIOI) 1^..' (oQio) 
'p'' (101) <p' (010) 



(1001) 



5* (1101) I 

/ (110) 




<p' (1100) 



Figure 2. For any reduced Rauzy graph of t2,2 of order n > 4 containing a BS 
factor as its vertex, there exists i > 1 such that the graph takes one of the 
depicted forms. 



{/9(e) I e edge in f 2.2'?} 
{p{v) V BS vertex in T2-2'} 



l2« 6J' 



and 



{ 



2f 6' 2^ 12-f' 



J> 



i^f ei- 



{p(e) I e edfi-e in T^.21} 
{p{v) I f BS vertex in T^.2e} — 121^^ 
(3) The sets of factor frequencies {p{e) | e € /^Ar+i(t2,2)} ^^'"e 0/ i/ie following form for 

N en. 

(a) {p{e)\eeC^{t2,2)} = {\}. 

(h) {p{e)\eeC2{t2,2)} = {\,l}. 

(c) // 2 • 2^ < iV < 3 • 2^ for some ^ G N, then 

r 1 1 1 1 

\2^6'2^T2 
("d; // 3 • 2*^ < TV < 4 • 2*^ /or some £ G N, i/ien 

1 1 



Me)|eG£^+i(t2,2)} = ^^-,^- U^- 



Me) |eG/:w+i(t2,2)} = |^^-jU^2£+i3 

(e) IfN = 2- 2^ /or some ^ G N, i/ien 

{p(e) I e G CN+iitb,m)} 

(f) IfN = 3- 2^ for some ^ G N, i/ien 

{p(e) I e G £Af+i(tfo,m)} = 



1111 

2^+i3'2^+^6 



1 1 

2^6 



1111 
2^6'2^12 

The RRG method says that it suffices to describe frequencies of edges and vertices being BS 
factors in reduced Rauzy graphs of order n, where 1 < n < 26 — 1, in order to get {p{e) | e G 
^n+ii^)} for ah n G N. Using the description of BS factors from [8], we obtain the form of 
reduced Rauzy graphs for 1 < n < 26 — 1. 

(1) For 1 < n < 6, the reduced Rauzy graph r„ has m vertices. All of them are BS factors 
of the form k{k + 1) . . . (/c + n — 1). Since each of them is equal to 11^(01 ... (n — 1), their 
frequencies are the same. Moreover, 
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• e is an edge ending in 01 ... (n — 1) if and only if liki^) is an edge ending in 
k{k + l)...{k + n-l), 

• e is an edge ending in 01 ... (n — 1) if and only if ^k+n-ii^) is an edge starting in 
k{k + l)... (k + n-l), 

and since p{e) = p(Ilk{e)) = /9(^fc+„_i(e)), it suffices to describe frequencies of edges 
ending in 01 ... (n — 1) in order to get all edge frequencies of r„. As shown in [8], 
Lext(01 . . . (n - 1)) = {-1 + A;(6 - 1) I A; € {0, 1, . . . , g - 1}}. 

Lemma 4.3. Denote f = p(01). Then f = -^^jfl-ri and for 1 <n <b, the frequencies 
of the vertex tj; = 01 . . . (n — 1) and of the edges ending in w satisfy 

P(0) = h 
/9(01...(n-l)) = (n-1)/-^ forn>2, 

p((-l)01...(n-l)) = nf-^, 

p{{-l + k{b - 1))01 . . . {n - 1)) = ^f forke{l,...,q-l}. 

Proof. Let us proceed by induction on n. Let n = 1, then p{0) = — by Property (1) of 
ipb,m and tfj^m- It holds by Corollary 2.6 and Proposition 2.2 for k G {1, . . . ,q — 1} that 

p((-l + k{b - 1))0) = p{^{-l + {k-l){b- 1))0)) = ^p((-l + (fe - 1)(6 - 1))0). 

Thus pii-1 + kib - 1))0) = ^pii-m = ^p(n_i(01)) = ^/. Using Observation 2.5, 
we obtain / = p(0) — Ylt=i W- Therefore 

6«-i b - 1 
m 6"? — 1 









/ 


Let 1 < n + 1 < 6. Assume 




Pii- 

9{{-l + k{b- 


p(01. 
-1)01. 
1))01. 




1)) 
1)) 
1)) 



(n - 1)/ - ^^ for n > 2, 

nf-^^=^, 

Wf foTke{l,...,q-l}. 

Then, p{01 ...n) = /9(n_i(01 . . . n)) = /9((-l)01 . . . (n - 1)) = n/ - ^. Applying 
Corollary 2.6, we get p{{-l + k{b - 1))01 . . . n) = p((-l + k{b - 1))01 . . . (n - 1)) = ^/. 
Using the Kirchhoff's law for frequencies (Observation 2.5), we have /)((— 1)01 . . . n) = 

(2) For 6+1 < n < 26 — 1, the reduced Rauzy graph r„ has 3m vertices: m of them 
are BS factors of the form 11^(01 . . . (n — 1)), k G Z^, m of them are LS factors of 
the form n/;(01 ... (6 — 1)1 . . . (n — 6)), k G Z^, ^ti of them are RS factors obtained by 
applying ^o to LS factors. Since symmetries preserve frequencies, all BS factors have 
their frequency equal to p{01 ... (n — 1)) and similarly, all LS and RS factors have their 
frequency equal to p{01 ... (6 — 1)1 . . . (n — 6)). By analogous arguments as in part (1), 
we deduce that it suffices to describe frequencies of edges ending in 01 ... (n — 1) and in 
01 ... (6— 1)1 ... (n — 6) and the frequency of the unique edge 01 ... (6— 1)1 ... (n + 1 — 6) 
starting in 01 ... (6 — 1)1 ... (n — 6) in order to get all edge frequencies of r„. Again 
by [8], 

Lext(01...(n-1)) = {-1,6-2}, 
Lext(01...(6-l)l...(n-6)) = {-1 + A;(6 - 1) | A: G {0, 1, . . . ,g - 1}}. 

Lemma 4.4. Denote f = p(01). Then for b+l<n<2b— 1, the frequencies 



p(01. 


..{n- 


-1)) 


y9((-l)01., 


..(n- 


-1)) 


'{b-2)01., 


..(n- 


-1)) 
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(a) of the BS vertex ti; = 01 . . . (n — 1) and of the edges ending in w satisfy 

P(01...(n-1)) = ^/-^^^/, 
p((-l)01...(n-l)) = ^/-lli^/, 
K(6-2)01...(n-l)) = ^/, 

(b) of the edge 01 ... (6 — 1)1 ... (n + 1 — h) starting in the LS vertex v = 01 ... (6 — 
1)1 ... (n — 6) and of the edges ending in v satisfy 

p{01...{b-l)l...{n + l-b)) = If, 

p((-l)01...(6-l)l...(n-6)) = j,f, 

p((-l + (6-l))01...(fe-l)l...(n-6)) = 1(2/ -ij), 

p{{-l + k{b - l))Ol... {b- 1)1... (n-b)) = ^f forke{2,...,q-l}. 

Proof. Let us proceed by induction on n. 

(a) Let n = 6+1, then using part (1), we obtain p(01 . . . 6) = p(n_i(01 . . . 6)) = 
/9((-l)01 . . . (6 - 1)) = 6/ - ^ = ^/. By Corollary 2.6, Proposition 2.2, and Obser- 
vation 3.2, we have p((6 - 2)01 ...b) = p{ip{i-l)Ob)) = p(v3^((l - 6)6)) = ^p((l - 6)6) = 
^p{Um-bi{l - 6)6)) = ^p((-l + iq- 2)(6 - 1))0) = ^/. Finally, applying Observa- 
tion 2.5, we get />((-l)0 ...b) = ^f - ^f. 
Let 6+l<n + l<26-l. Assume 

1 f _ (n-b-l) J. 

1 f _ ("-^) f 

fog/- 

Then, p(01 ...n)= p(n„i(01 . . . n)) = p((-l)01 . . . (n - 1)) = ^/ - (^/. Applying 
Corollary 2.6, we get p((6 - 2)01 . . . n) = /)((6 - 2)01 ... (n - 1)) = ^/. Using the 
KirchhofF's law for frequencies (Observation 2.5), we have /?((— 1)01 . . .n) = m^t/ — 

jn-b) r L f — 1 f _ (n+l-b) ^ 

bi J bi J 69-1/ ^9 /• 

(6) Let n = 6 + 1, then by Corollary 2.6 and Proposition 2.2, it follows p{01 . . . (6 — 
1)12) = p{ip{01)) = ^f. Again, by Corollary 2.6 and Proposition 2.2, it holds for 
ke {2,...,g-l}thatp((-l + A:(6-l))01...(6-l)l) = /9(99((-l + (A;- 1)(6- 1))01)) = 
^p((— 1 + {k — 1)(6 — 1))01) = ^/, and for A: = 1, we have by the same arguments 
p{{-l + (6 - 1))01 ... (6 - 1)1) = /9(<p((-l)01)) = ip((-l)01) = i(2/ - i). Finally, by 
the Kirchhoff's law for frequencies (Observation 2.5), we derive p{{—l)01 ... (6 — 1)1) = 

bJ ~ by^J ~ m) ~ /-^k=2 bk J ~ IflJ- 

Let 6+l<n + l<26-l. Assume 

p(01...(6-l)l...(n + l-6)) = 1/, 

p((-l)01...(6-l)l...(n-6)) = ^/, 

p((-l + (6-l))01...(6-l)l...(n-6)) = i(2/-i), 

p((-l + A:(6-l))01...(6-l)l...(n-6)) = ^/ for fc G {2, . . . ,g - 1}. 

By Corollary 2.6, we have /9(01 . . . (6-1)1 . . . (n+2-6)) = p(01 ... (6-1)1 .. . (n+1-6)) = 
\f. Again by Corollary 2.6, we get for ah A; G {2, . . . , ^ - 1}, p((-l + k{b - 1)01 . . . (6 - 
1)1 ... (n + 1 - 6)) = /j((-l + A:(6 - 1)01 ... (6 - 1)1 ... (n - 6)) = i/, and analogously, 
p{{-l + (6 - 1)01 ... (6 - 1)1 ... (n + 1 - 6)) = p((-l + (6 - 1)01 ... (6 - 1)1 ... (n - 
6)) = -^(2/ — — ). Using the Kirchhoff's law for frequencies (Observation 2.5), we have 
/>((-l)01 ... (6™- 1)1 ... (n + 1 - 6)) = 1/ - 1(2/ - i,) - YV2 wf = hf- ° 
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Theorem 4.5. Let b > 2,m > l,b,m G N, and b ^ 1 mod m. Let t^^m be the fixed point 
starting in of the morphisni (ph^m defined in (2) . Then the sets of factor frequencies take the 
following form for A^ G N. 



N 


Me) eGA,,„(iV+l)} 





m. 


1 


^, w/zere /c G {0, . . . ,g - 1} 


(n - 1)6^ <N < nb\ £ G N, 
where n G {3, . . . , 6} 


u;/iere A; G {1, . . . ,g — 1} 


(n - 1)6^ <N < nb\ £ G N, 
where n G {6 + 1, . . . , 26 - 1} 


1 /" 1 f ("-^-1) f\ 1 / 1 f {«-b) f\ 1 fof 1 \ 1 / 1 f \ 

FV69-1/ 69 Jjibiybi-iJ m Jj^b^+iy^J m>'¥\WJ)' 
where k £{!,... ,q} 


(26-1)6^ <Af< 26^+1, ^gN 


5^+1 (2/ „) , ^i+i {y^f), 
where A; G {0, . . . , (7 — 1} 


n6^ ^ G N, 

w/iere n G {2, . . . , 6} 


where k £ {I, . . . ,q — 1} 


n6^£GN, 

w/iere n G {6 + 1, . . . , 26 - 1} 


where k £ {1,. . . ,q} 



Proof The statement is obtained when putting together Lemmas 4.3 and 4.4 and step (3) of 
the RRG method. D 



5. Upper bound on frequencies 

In the last section, let us show and explain that the optimal upper bound on the number 
of factor frequencies in infinite words whose language is invariant under more symmetries, here 
recalled as Theorem 3.3, is not reached for large enough n for any generalized Thue- Morse word 
tfe,m with b > 2,m > 1,6, tti G N, and 6 7^ 1 mod m. First of all, the upper bound cannot 
be attained for q > 2 : since q corresponds to the number of extensions of special factors, the 
estimate #{w G >Cn(u) | w RS} < C{n + l)—C{n) = X]^g£„(u)(#R'6xt(w) — l), used in the proof 
of Theorem 3.3, is too rough for q > 2. Indeed, the upper bound is greater than or equal to 
2q — 2 (we have used the result C{n + 1) — C{n) > qm — m from [8]), while applying Theorem 4.5, 
we can see that the number of frequencies of factors of the same length is at most g + 3. 

Nevertheless, even in the case of g = 2, if we take n = (26 — 1)6^ for any I G N, then ^{p{e) I 
e G Cttmi'^T- + 1)} = 3 = g + 1 by Theorem 4.5. By the description of factor complexity from [8], 
we have C{n + 1) — C{n) = qm = 2m. It follows from Properties of 'Pb,m a-^d t^^m summarized 
in Section 4 that ^G = 2m and the number of BS factors of length n is equal to m and is the 
same as the number of BS factors being ^-palindromes for some antimorphism ^^^ x G 
Therefore, the upper bound from Theorem 3.3 is equal to g— ( Sttt, + 2m 



m 



m 



4. Hence, 
for any M G N, the equality in the upper bound from Theorem 3.3 is not reached for all n > M. 
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Let us explain the reason. In the proof of Theorem 3.3, we have used the invariance of C{tb,m) 
under symmetries in order to obtain the upper bound on the number of factor frequencies. 
However, some factors may have the same frequency for another reason. We observe as a direct 
consequence of Corollary 2.6 the following. 

Observation 5.1. If w is a BS factor of an infinite word u such that for every a € hext{w), 
there exists a unique b G Rext(ti;) satisfying awb € i3(u) (let us call such BS factors weakj, then 
p{aw) = p{awb) = p{wb). 

For n = 26—1, the BS factor of the form w = 01 . . . (26 — 2) is weak: w can be extended in only 
two ways, as (6 — 2)w{2b — 1) and as {—l)wb. Hence, p{{—l)w) = p{wb) even if these factors are 
not symmetric images of each other. Similarly, the BS factor f = c/?^ m(^) *^^ length n = (26— 1)6 
is weak and p{av) = p{vb), where a is the last letter of vim^—l), i-e., a = — 1 + £(6 — 1). It holds 
again that av and vb are not symmetric images of one another. 
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